| R-learning in actor-critic model offers a biologically relevant mechanism for sequential decision-making | Dec 1, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization | Nov 18, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Modality-Buffet for Real-Time Object Detection | Nov 17, 2020 | Decision MakingObject | —Unverified | 0 |
| A New Bandit Setting Balancing Information from State Evolution and Corrupted Context | Nov 16, 2020 | Decision MakingEfficient Exploration | CodeCode Available | 0 |
| Robust Batch Policy Learning in Markov Decision Processes | Nov 9, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reliable Off-policy Evaluation for Reinforcement Learning | Nov 8, 2020 | Decision MakingOff-policy evaluation | —Unverified | 0 |
| Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial | Nov 6, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Loss Bounds for Approximate Influence-Based Abstraction | Nov 3, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reinforcement Learning with Efficient Active Feature Acquisition | Nov 2, 2020 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Multi-IRS-assisted Multi-Cell Uplink MIMO Communications under Imperfect CSI: A Deep Reinforcement Learning Approach | Nov 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |