| Regret Bounds for Online Portfolio Selection with a Cardinality Constraint | Dec 1, 2018 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Structural Causal Bandits: Where to Intervene? | Dec 1, 2018 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making | Dec 1, 2018 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Monte-Carlo Tree Search for Constrained POMDPs | Dec 1, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Tight Bayesian Ambiguity Sets for Robust MDPs | Nov 15, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Meta-Learning for Multi-objective Reinforcement Learning | Nov 8, 2018 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling | Oct 29, 2018 | Collaborative FilteringDecision Making | CodeCode Available | 1 |
| Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With Reneging | Oct 29, 2018 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| On preserving non-discrimination when combining expert advice | Oct 28, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Efficient Sequence Labeling with Actor-Critic Training | Sep 30, 2018 | Decision MakingNER | CodeCode Available | 0 |