| Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning | May 21, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning | May 21, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Robust optimal policies for team Markov games | May 16, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bandit based centralized matching in two-sided markets for peer to peer lending | May 6, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Data-Efficient Reinforcement Learning for Malaria Control | May 4, 2021 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization | May 1, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Statistical Inference with M-Estimators on Adaptively Collected Data | Apr 29, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Universal Off-Policy Evaluation | Apr 26, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| Reinforcement Learning using Guided Observability | Apr 22, 2021 | Decision MakingMuJoCo | —Unverified | 0 |
| Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning | Apr 20, 2021 | ClusteringDecision Making | —Unverified | 0 |