| The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation | Jun 8, 2021 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Curriculum Design for Teaching via Demonstrations: Theory and Applications | Jun 8, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Meta-Learning Reliable Priors in the Function Space | Jun 6, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Heuristic-Guided Reinforcement Learning | Jun 5, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Be Considerate: Objectives, Side Effects, and Deciding How to Act | Jun 4, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Learning to schedule job-shop problems: Representation and policy learning using graph neural network and reinforcement learning | Jun 2, 2021 | Decision MakingGraph Neural Network | —Unverified | 0 |
| Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning | May 21, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning | May 21, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Robust optimal policies for team Markov games | May 16, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bandit based centralized matching in two-sided markets for peer to peer lending | May 6, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Data-Efficient Reinforcement Learning for Malaria Control | May 4, 2021 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization | May 1, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Statistical Inference with M-Estimators on Adaptively Collected Data | Apr 29, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Universal Off-Policy Evaluation | Apr 26, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| Reinforcement Learning using Guided Observability | Apr 22, 2021 | Decision MakingMuJoCo | —Unverified | 0 |
| Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem | Apr 22, 2021 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning | Apr 20, 2021 | ClusteringDecision Making | —Unverified | 0 |
| Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images | Apr 14, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning | Apr 9, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Ecole: A Library for Learning Inside MILP Solvers | Apr 6, 2021 | BIG-bench Machine LearningCombinatorial Optimization | CodeCode Available | 0 |
| Learning-Based UAV Trajectory Optimization with Collision Avoidance and Connectivity Constraints | Apr 3, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Online Convex Optimization with Continuous Switching Constraint | Mar 21, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Forward and Backward Bellman equations improve the efficiency of EM algorithm for DEC-POMDP | Mar 19, 2021 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Situated Language Learning via Interactive Narratives | Mar 18, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Encrypted Linear Contextual Bandit | Mar 17, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Meta-Learning for Planning: Automatic Synthesis of Sample Based Planners | Mar 13, 2021 | Decision MakingMeta-Learning | —Unverified | 0 |
| Optimal sequential decision making with probabilistic digital twins | Mar 12, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Automatic Goal Generation using Dynamical Distance Learning | Mar 9, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games | Mar 8, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games | Mar 8, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Adversarial Environment Generation for Learning to Navigate the Web | Mar 2, 2021 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Batched Neural Bandits | Feb 25, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Hyperparameter Transfer Learning with Adaptive Complexity | Feb 25, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model | Feb 23, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 1 |
| SENTINEL: Taming Uncertainty with Ensemble-based Distributional Reinforcement Learning | Feb 22, 2021 | Decision MakingDistributional Reinforcement Learning | —Unverified | 0 |
| Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models | Feb 16, 2021 | Decision MakingMeta Reinforcement Learning | —Unverified | 0 |
| Causal Markov Decision Processes: Learning Good Interventions Efficiently | Feb 15, 2021 | Decision MakingMarketing | —Unverified | 0 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games | Feb 13, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module | Feb 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Representation Matters: Offline Pretraining for Sequential Decision Making | Feb 11, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Patterns, predictions, and actions: A story about machine learning | Feb 10, 2021 | BIG-bench Machine LearningCausal Inference | —Unverified | 0 |
| An Analysis of Frame-skipping in Reinforcement Learning | Feb 7, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management | Feb 6, 2021 | Decision MakingManagement | —Unverified | 0 |
| Improving Human Decision-Making by Discovering Efficient Strategies for Hierarchical Planning | Jan 31, 2021 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Reinforcement Learning for Freight Booking Control Problems | Jan 29, 2021 | BIG-bench Machine LearningDecision Making | —Unverified | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors | Jan 26, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| High-Confidence Off-Policy (or Counterfactual) Variance Estimation | Jan 25, 2021 | counterfactualDecision Making | —Unverified | 0 |
| GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning | Jan 24, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| An empirical evaluation of active inference in multi-armed bandits | Jan 21, 2021 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 |