| Automatic Goal Generation using Dynamical Distance Learning | Mar 9, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games | Mar 8, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games | Mar 8, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Adversarial Environment Generation for Learning to Navigate the Web | Mar 2, 2021 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Batched Neural Bandits | Feb 25, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Hyperparameter Transfer Learning with Adaptive Complexity | Feb 25, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| SENTINEL: Taming Uncertainty with Ensemble-based Distributional Reinforcement Learning | Feb 22, 2021 | Decision MakingDistributional Reinforcement Learning | —Unverified | 0 |
| Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models | Feb 16, 2021 | Decision MakingMeta Reinforcement Learning | —Unverified | 0 |
| Causal Markov Decision Processes: Learning Good Interventions Efficiently | Feb 15, 2021 | Decision MakingMarketing | —Unverified | 0 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games | Feb 13, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module | Feb 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Representation Matters: Offline Pretraining for Sequential Decision Making | Feb 11, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Patterns, predictions, and actions: A story about machine learning | Feb 10, 2021 | BIG-bench Machine LearningCausal Inference | —Unverified | 0 |
| An Analysis of Frame-skipping in Reinforcement Learning | Feb 7, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management | Feb 6, 2021 | Decision MakingManagement | —Unverified | 0 |
| Improving Human Decision-Making by Discovering Efficient Strategies for Hierarchical Planning | Jan 31, 2021 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Reinforcement Learning for Freight Booking Control Problems | Jan 29, 2021 | BIG-bench Machine LearningDecision Making | —Unverified | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors | Jan 26, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| High-Confidence Off-Policy (or Counterfactual) Variance Estimation | Jan 25, 2021 | counterfactualDecision Making | —Unverified | 0 |
| GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning | Jan 24, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deciding What to Learn: A Rate-Distortion Approach | Jan 15, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reinforced Imitative Graph Representation Learning for Mobile User Profiling: An Adversarial Training Perspective | Jan 7, 2021 | Decision MakingGraph Representation Learning | —Unverified | 0 |
| Divide-and-Conquer Monte Carlo Tree Search | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Computing Preimages of Deep Neural Networks with Applications to Safety | Jan 1, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Learning to Make Decisions via Submodular Regularization | Jan 1, 2021 | Active LearningBayesian Optimization | —Unverified | 0 |
| Learning to Recover from Failures using Memory | Jan 1, 2021 | Decision MakingMeta-Learning | —Unverified | 0 |
| Understanding and Leveraging Causal Relations in Deep Reinforcement Learning | Jan 1, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients | Dec 30, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Regret bound for Non-stationary Multi-Armed Bandits with Fairness Constraints | Dec 24, 2020 | Decision MakingFairness | —Unverified | 0 |
| Autonomous Charging of Electric Vehicle Fleets to Enhance Renewable Generation Dispatchability | Dec 22, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Off-Policy Optimization of Portfolio Allocation Policies under Constraints | Dec 21, 2020 | Decision MakingPortfolio Optimization | CodeCode Available | 0 |
| Learning Mobile Robot Navigation in the Dense Crowd with Deep Reinforcement Learning | Dec 14, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Demystify Painting with RL | Dec 14, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Hindsight and Sequential Rationality of Correlated Play | Dec 10, 2020 | counterfactualDecision Making | CodeCode Available | 0 |
| Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Planning with General Objective Functions: Going Beyond Total Rewards | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Delay and Cooperation in Nonstochastic Linear Bandits | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On Efficiency in Hierarchical Reinforcement Learning | Dec 1, 2020 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Improving Online Rent-or-Buy Algorithms with Sequential Decision Making and ML Predictions | Dec 1, 2020 | BIG-bench Machine LearningDecision Making | —Unverified | 0 |
| R-learning in actor-critic model offers a biologically relevant mechanism for sequential decision-making | Dec 1, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization | Nov 18, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Modality-Buffet for Real-Time Object Detection | Nov 17, 2020 | Decision MakingObject | —Unverified | 0 |
| A New Bandit Setting Balancing Information from State Evolution and Corrupted Context | Nov 16, 2020 | Decision MakingEfficient Exploration | CodeCode Available | 0 |
| Robust Batch Policy Learning in Markov Decision Processes | Nov 9, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reliable Off-policy Evaluation for Reinforcement Learning | Nov 8, 2020 | Decision MakingOff-policy evaluation | —Unverified | 0 |
| Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial | Nov 6, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Loss Bounds for Approximate Influence-Based Abstraction | Nov 3, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reinforcement Learning with Efficient Active Feature Acquisition | Nov 2, 2020 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Multi-IRS-assisted Multi-Cell Uplink MIMO Communications under Imperfect CSI: A Deep Reinforcement Learning Approach | Nov 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |