| Meta-Learning for Multi-objective Reinforcement Learning | Nov 8, 2018 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With Reneging | Oct 29, 2018 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| On preserving non-discrimination when combining expert advice | Oct 28, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Efficient Sequence Labeling with Actor-Critic Training | Sep 30, 2018 | Decision MakingNER | CodeCode Available | 0 |
| Resilient Computing with Reinforcement Learning on a Dynamical System: Case Study in Sorting | Sep 25, 2018 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Geometric Multi-Model Fitting by Deep Reinforcement Learning | Sep 22, 2018 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Predicting Periodicity with Temporal Difference Learning | Sep 20, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games | Sep 10, 2018 | counterfactualDecision Making | —Unverified | 0 |
| Sequential Monte Carlo Bandits | Aug 8, 2018 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning Game | Jul 17, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |