| Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes | Jan 29, 2022 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Accelerate Model Parallel Training by Using Efficient Graph Traversal Order in Device Placement | Jan 21, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making | Jan 20, 2022 | counterfactualDecision Making | —Unverified | 0 |
| Active Learning-Based Multistage Sequential Decision-Making Model with Application on Common Bile Duct Stone Evaluation | Jan 13, 2022 | Active LearningDecision Making | —Unverified | 0 |
| Automated Reinforcement Learning: An Overview | Jan 13, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning | Jan 12, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Subgoal-Based Explanations for Unreliable Intelligent Decision Support Systems | Jan 11, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| State of the Art of User Simulation approaches for conversational information retrieval | Jan 10, 2022 | Decision MakingInformation Retrieval | —Unverified | 0 |
| Temporal Detection of Anomalies via Actor-Critic Based Controlled Sensing | Jan 3, 2022 | Anomaly DetectionDecision Making | —Unverified | 0 |
| Socially-Optimal Mechanism Design for Incentivized Online Learning | Dec 29, 2021 | Decision MakingEdge-computing | —Unverified | 0 |
| Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions | Dec 26, 2021 | Decision MakingGeneral Reinforcement Learning | —Unverified | 0 |
| A Survey on Interpretable Reinforcement Learning | Dec 24, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Differentially Private Regret Minimization in Episodic Markov Decision Processes | Dec 20, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Revisiting Game Representations: The Hidden Costs of Efficiency in Sequential Decision-making Algorithms | Dec 20, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Application of Deep Reinforcement Learning to Payment Fraud | Dec 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach | Dec 7, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| MDPFuzz: Testing Models Solving Markov Decision Processes | Dec 6, 2021 | Autonomous DrivingCollision Avoidance | —Unverified | 0 |
| Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning | Dec 6, 2021 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification | Dec 1, 2021 | Decision MakingDiagnostic | CodeCode Available | 1 |
| Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions | Nov 29, 2021 | Contrastive LearningDecision Making | —Unverified | 0 |
| Pessimistic Model Selection for Offline Deep Reinforcement Learning | Nov 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation | Nov 28, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Neural Column Generation for Capacitated Vehicle Routing | Nov 24, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability | Nov 24, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adversarial Deep Learning for Online Resource Allocation | Nov 19, 2021 | Decision MakingDeep Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Entity Alignment | Nov 16, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Route Optimization via Environment-Aware Deep Network and Reinforcement Learning | Nov 16, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| AutoGMap: Learning to Map Large-scale Sparse Graphs on Memristive Crossbars | Nov 15, 2021 | CPUDecision Making | CodeCode Available | 0 |
| Automatic Goal Generation using Dynamical Distance Learning | Nov 7, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| SOPE: Spectrum of Off-Policy Estimators | Nov 6, 2021 | Decision MakingOff-policy evaluation | CodeCode Available | 0 |
| Regular Decision Processes for Grid Worlds | Nov 5, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning | Nov 4, 2021 | Decision MakingImitation Learning | CodeCode Available | 1 |
| Partial-Adaptive Submodular Maximization | Nov 1, 2021 | Active LearningDecision Making | —Unverified | 0 |
| A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning | Oct 27, 2021 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning | Oct 27, 2021 | Decision MakingImitation Learning | CodeCode Available | 1 |
| The Value of Information When Deciding What to Learn | Oct 26, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Dynamic Causal Bayesian Optimization | Oct 26, 2021 | Bayesian OptimizationCausal Inference | CodeCode Available | 1 |
| HSVI for zs-POSGs using Concavity, Convexity and Lipschitz Properties | Oct 25, 2021 | Decision MakingHeuristic Search | —Unverified | 0 |
| Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits | Oct 23, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models | Oct 22, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| Anti-Concentrated Confidence Bonuses for Scalable Exploration | Oct 21, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations | Oct 19, 2021 | Decision MakingModel Selection | CodeCode Available | 0 |
| SS-MAIL: Self-Supervised Multi-Agent Imitation Learning | Oct 18, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network | Oct 16, 2021 | Behavioural cloningDecision Making | —Unverified | 0 |
| Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning | Oct 9, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits | Oct 8, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Medical Dead-ends and Learning to Identify High-risk States and Treatments | Oct 8, 2021 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations | Oct 6, 2021 | Decision MakingNavigate | —Unverified | 0 |
| Gambits: Theory and Evidence | Oct 5, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams | Oct 2, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |