| Joint AP Probing and Scheduling: A Contextual Bandit Approach | Aug 6, 2021 | Decision MakingScheduling | —Unverified | 0 |
| RLTutor: Reinforcement Learning Based Adaptive Tutoring System by Modeling Virtual Student with Fewer Interactions | Jul 31, 2021 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Lyapunov-based uncertainty-aware safe reinforcement learning | Jul 29, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 |
| On Blame Attribution for Accountable Multi-Agent Sequential Decision Making | Jul 26, 2021 | Decision MakingFairness | —Unverified | 0 |
| Robust Adaptive Submodular Maximization | Jul 23, 2021 | Active LearningDecision Making | —Unverified | 0 |
| Reinforcement Learning Agent Training with Goals for Real World Tasks | Jul 21, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| High-Accuracy Model-Based Reinforcement Learning, a Survey | Jul 17, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks | Jul 13, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Metalearning Linear Bandits by Prior Update | Jul 12, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Neural Contextual Bandits without Regret | Jul 7, 2021 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Bayesian decision-making under misspecified priors with applications to meta-learning | Jul 3, 2021 | Decision MakingMeta-Learning | —Unverified | 0 |
| Markov Decision Process modeled with Bandits for Sequential Decision Making in Linear-flow | Jul 1, 2021 | Decision MakingMarketing | —Unverified | 0 |
| Bounded rationality for relaxing best response and mutual consistency: The Quantal Hierarchy model of decision-making | Jun 30, 2021 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Decision making with dynamic probabilistic forecasts | Jun 30, 2021 | Decision Makingenergy trading | —Unverified | 0 |
| UAV-assisted Online Machine Learning over Multi-Tiered Networks: A Hierarchical Nested Personalized Federated Learning Approach | Jun 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Action Set Based Policy Optimization for Safe Power Grid Management | Jun 29, 2021 | Decision MakingManagement | —Unverified | 0 |
| A Reinforcement Learning Approach for Sequential Spatial Transformer Networks | Jun 27, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Building Intelligent Autonomous Navigation Agents | Jun 25, 2021 | Autonomous NavigationDecision Making | —Unverified | 0 |
| Not all users are the same: Providing personalized explanations for sequential decision making problems | Jun 23, 2021 | AllClustering | —Unverified | 0 |
| Lorenz System State Stability Identification using Neural Networks | Jun 16, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Probabilistic DAG Search | Jun 16, 2021 | Decision Makingfeature selection | —Unverified | 0 |
| Robust Reinforcement Learning Under Minimax Regret for Green Security | Jun 15, 2021 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| A modular framework for object-based saccadic decisions in dynamic scenes | Jun 10, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Information Avoidance and Overvaluation in Sequential Decision Making under Epistemic Constraints | Jun 9, 2021 | Decision MakingManagement | —Unverified | 0 |
| Cooperative Online Learning with Feedback Graphs | Jun 9, 2021 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Curriculum Design for Teaching via Demonstrations: Theory and Applications | Jun 8, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Meta-Learning Reliable Priors in the Function Space | Jun 6, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Heuristic-Guided Reinforcement Learning | Jun 5, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Be Considerate: Objectives, Side Effects, and Deciding How to Act | Jun 4, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Learning to schedule job-shop problems: Representation and policy learning using graph neural network and reinforcement learning | Jun 2, 2021 | Decision MakingGraph Neural Network | —Unverified | 0 |
| Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning | May 21, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning | May 21, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Robust optimal policies for team Markov games | May 16, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bandit based centralized matching in two-sided markets for peer to peer lending | May 6, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Data-Efficient Reinforcement Learning for Malaria Control | May 4, 2021 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization | May 1, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Statistical Inference with M-Estimators on Adaptively Collected Data | Apr 29, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Universal Off-Policy Evaluation | Apr 26, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| Reinforcement Learning using Guided Observability | Apr 22, 2021 | Decision MakingMuJoCo | —Unverified | 0 |
| Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning | Apr 20, 2021 | ClusteringDecision Making | —Unverified | 0 |
| Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images | Apr 14, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning | Apr 9, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Ecole: A Library for Learning Inside MILP Solvers | Apr 6, 2021 | BIG-bench Machine LearningCombinatorial Optimization | CodeCode Available | 0 |
| Learning-Based UAV Trajectory Optimization with Collision Avoidance and Connectivity Constraints | Apr 3, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Online Convex Optimization with Continuous Switching Constraint | Mar 21, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Forward and Backward Bellman equations improve the efficiency of EM algorithm for DEC-POMDP | Mar 19, 2021 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Situated Language Learning via Interactive Narratives | Mar 18, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Encrypted Linear Contextual Bandit | Mar 17, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Meta-Learning for Planning: Automatic Synthesis of Sample Based Planners | Mar 13, 2021 | Decision MakingMeta-Learning | —Unverified | 0 |
| Optimal sequential decision making with probabilistic digital twins | Mar 12, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |